The Role of Audio and Tags in Music Mood Prediction: A Study Using Semantic Layer Projection

نویسندگان

Pasi Saari

Tuomas Eerola

György Fazekas

Mathieu Barthet

Olivier Lartillot

Mark B. Sandler

چکیده

Semantic Layer Projection (SLP) is a method for automatically annotating music tracks according to expressed mood based on audio. We evaluate this method by comparing it to a system that infers the mood of a given track using associated tags only. SLP differs from conventional auto-tagging algorithms in that it maps audio features to a low-dimensional semantic layer congruent with the circumplex model of emotion, rather than training a model for each tag separately. We build the semantic layer using two large-scale data sets – crowd-sourced tags from Last.fm, and editorial annotations from the I Like Music (ILM) production music corpus – and use subsets of these corpora to train SLP for mapping audio features to the semantic layer. The performance of the system is assessed in predicting mood ratings on continuous scales in the two data sets mentioned above. The results show that audio is in general more efficient in predicting perceived mood than tags. Furthermore, we analytically demonstrate the benefit of using a combination of semantic tags and audio features in automatic mood annotation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Knowledge Management On The Semantic Web: A Comparison of Neuro-Fuzzy and Multi-Layer Perceptron Methods For Automatic Music Tagging

This paper presents the preliminary analyses towards the development of a formal method for generating autonomous, dynamic ontology systems in the context of web-based audio signals applications. In the music domain, social tags have become important components of database management, recommender systems, and song similarity engines. In this study, we map the audio similarity features from the ...

متن کامل

Music Mood Representations from Social Tags

This paper presents findings about mood representations. We aim to analyze how do people tag music by mood, to create representations based on this data and to study the agreement between experts and a large community. For this purpose, we create a semantic mood space from last.fm tags using Latent Semantic Analysis. With an unsupervised clustering approach, we derive from this space an ideal c...

متن کامل

Audio Mood Classification Using Ensemble Classifier with Music Tag Based Indexing

This paper presents a system for audio classification using multiple binary ensemble classifiers with music tag based indexing in a one-versus-all classification scenario. The proposed system has won the audio classification task on mood dataset in MIREX 2010 and is implemented as follows. First, in the training phase, the frame-based 70-dimensional feature vectors are extracted from a training...

متن کامل

Mirex 2011: Audio Tag Classification Using Weighted-vote Nearest Neighbor Classification

In this long abstract, we present an algorithm for automatically annotating music with tags that is fast, scalable and relatively easy to implement. It uses acoustic similarity for propagating tags among audio items. The algorithm makes use of a variety of acoustical features, ranging from spectral features, to rhythm, tonal and highlevel features (such as mood, genre, gender). These features a...

متن کامل

Multi-Tasking with Joint Semantic Spaces for Large-Scale Music Annotation and Retrieval

Music prediction tasks range from predicting tags given a song or clip of audio, predicting the name of the artist, or predicting related songs given a song, clip, artist name or tag. That is, we are interested in every semantic relationship between the different musical concepts in our database. In realistically sized databases, the number of songs is measured in the hundreds of thousands or m...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

The Role of Audio and Tags in Music Mood Prediction: A Study Using Semantic Layer Projection

نویسندگان

چکیده

منابع مشابه

Knowledge Management On The Semantic Web: A Comparison of Neuro-Fuzzy and Multi-Layer Perceptron Methods For Automatic Music Tagging

Music Mood Representations from Social Tags

Audio Mood Classification Using Ensemble Classifier with Music Tag Based Indexing

Mirex 2011: Audio Tag Classification Using Weighted-vote Nearest Neighbor Classification

Multi-Tasking with Joint Semantic Spaces for Large-Scale Music Annotation and Retrieval

عنوان ژورنال:

اشتراک گذاری